SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: NAKAMURA, SEIJI 
SAKURAI, TAKASHI 
NEZU, JUNI-ICHI 

(ii) TITLE OF INVENTION: GENE ENCODING ADSEVERIN 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Birch, Stewart, Kolasch & Birch, LLP 

(B) STREET: P.O. Box 747 

(C) CITY: Falls Church 

(D) STATE: VA 

(E) COUNTRY: USA 

(F) ZIP: 22040-0747 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION : 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: MURPHY Jr., Gerald M. 

(B) REGISTRATION NUMBER: 28,977 

(C) REFERENCE/DOCKET NUMBER: 230-lldP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 205-8000 

(B) TELEFAX: (703) 205-8050 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Lys Val Ala His Val Lys Gin He Pro Phe Asp Ala 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Val Leu Thr Asn Asp Leu Thr Ala Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii.) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

lie Thr Asn Arg Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 27,. 2171 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGGCCGGAAC ATCGCGTGCC CGAGTC ATG GCC CAG GGG CTG TAC CAC GAG GAG 

Met Ala Gin Gly Leu Tyr His Glu Glu 
1 5 

TTC GCC CGC GCG GGC AAG CGG GCG -GGG CTG CAG GTC TGG AGA ATT GAG 
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Phe Ala Arg Ala Gly Lys Arg Ala Gly Leu Gin Val Trp Arg He Glu 
10 15 20 25 

AAG CTG GAG CTG GTG CCG GTG CCC GAG AGC GCG TAT GGC AAC TTC TAG 14 9 

Lvs Leu Glu Leu Val Pro Val Pro Glu Ser Ala Tyr Gly Asn Phe Tyr 
30 35 40 

GTC GGG GAT GCC TAG CTG GTG CTC CAC ACG ACG CAG GCC AGC CGG GGC 197 
Val Gly Asp Ala Tyr Leu Val Leu His Thr Thr Gin Ala Ser Arg Gly 
45 50 55 

TTC ACC TAC CGC CTG CAC TTC TGG CTG GGA AAG GAG TGT ACT CAG GAT 24 5 

Phe Thr Tvr Arg Leu His Phe Trp Leu Gly Lys Glu Cys Thr Gin Asp 
60 65 70 

GAA AGC ACA GCA GCT GCC ATC TTT ACT GTT CAG ATG GAT GAG TAT TTG 293 
Glu Ser Thr Ala Ala Ala He Phe Thr Val Gin Met Asp Asp Tyr Leu 
75 80 85 

GGT GGC AAA CCT GTG CAG AAC AGA GAA CTT CAA GGC TAT GAG TCT ACG 341 
Gly Gly Lys Pro Val Gin Asn Arg Glu Leu Gin Gly Tyr Glu Ser Thr 
90 95 100 105 

GAT TTT GTT GGC TAC TTT AAA GGA GGT CTG AAA TAC AAG GCT GGC GGT 38 9 

Asp Phe Val Gly Tyr Phe Lys Gly Gly Leu Lys Tyr Lys Ala Gly Gly 
110 115 120 

GTG GCG TCT GGA CTC AAT CAT GTG CTT ACA AAT GAC TTG ACT GCT CAG 4 37 

Val Ala: Ser Gly Leu Asn His Val Leu Thr Asn Asp Leu Thr Ala Gin 
125 130 135 

AGG CTC CTG CAT GTG AAA GGT CGG AGA GTC GTC AGG GCC ACG GAA GTT 4 85 

Arg. Leu Leu His Val Lys Gly Arg Arg Val Val Arg Ala Thr Glu Val 
140 145 150 

CCC CTA AGC TGG GAC AGT TTC AAC AAG GGT GAC TGC TTC ATC ATT GAC 533 
Pro Leu Ser Trp Asp Ser Phe Asn Lys Gly Asp Cys Phe He He Asp 
155 160 165 

CTT GGC ACT GAA ATT TAC CAG TGG TGT GGA TCT TCT TGC AAC AAG TAC 581 
Leu Gly Thr Glu He Tyr Gin Trp Cys Gly Ser Ser Cys Asn Lys Tyr 
170 175 180 185 

GAG CGC CTG AAG GCC AGC CAG GTT GCC ATC GGC ATT CGG GAC AAT GAA 62 9 

Glu Arg Leu Lys Ala Ser Gin Val Ala He Gly He Arg Asp Asn Glu 
190 195 200 

AGG AAA GGC AGA GCT CAG CTG ATT GTG GTA GAA GAA GGG AGT GAA CCA 677 
Arg Lys Gly Arg Ala Gin Leu He Val Val Glu Glu Gly Ser Glu Pro 
205 210 215 

TCA GAG CTT ACA AAG GTA TTA GGG GAA AAG CCA AAG CTT AGG GAT GGA 72 5 

Ser Glu Leu Thr Lys Val Leu Gly Glu Lys Pro Lys Leu Arg Asp Gly 
220 225 230 

GAA GAT GAT GAT GAC ATC AAA GCA GAT ATA ACT AAT AGG AAA ATG GCT 77 3 

Glu Asp Asp Asp Asp He Lys Ala Asp He Thr Asn Arg Lys Met Ala 
235 240 245 

AAA CTC TAC ATG GTT TCA GAT GCC AGT GGC TCC ATG AAA GTG AGT CTG 821 
Lys Leu Tyr Met Val Ser Asp Ala Ser Gly Ser Met Lys Val Ser Leu 
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250 



255 



260 



265 



GTG GCA GAA GAA AAC CCC TTC TCC ATG GCG ATG CTT CTG TCT GAA GAA 869 
Val Ala Glu Glu Asn Pro Phe Ser Met Ala Met Leu Leu Ser Glu Glu 
270 275 280 

TGC TTC ATT TTG GAC CAC GGT GCT GCA AAA CAG ATT TTT GTA TGG AAA 917 
Cys Phe lie Leu Asp His Gly Ala Ala Lys Gin He Phe Val Trp Lys 
285 290 295 

GGT AAA GAT GCT AAT CCC CAG GAG AGA AAG GCT GCC ATG AAG ACA GCT 965 
Gly Lys Asp Ala Asn Pro Gin Glu Arg Lys Ala Ala Met Lys Thr Ala 
300 305 310 

GAG GAA TTC CTA CAG CAA ATG AAT TAT TCT ACG AAT ACC CAA ATT CAA 1013 
Glu Glu Phe Leu Gin Gin Met Asn Tyr Ser Thr Asn Thr Gin He Gin 
315 320 325 

GTT CTT CCA GAA GGA GGT GAA ACA CCA ATC TTC AAA CAG TTC TTT AAG 1061 
Val Leu Pro Glu Gly Gly Glu Thr Pro He Phe Lys Gin Phe Phe Lys 
330 335 340 345 

GAC TGG AGA GAT AGA GAT CAG AGC GAT GGC TTC GGG AAA GTG TAT GTC 1109 
Asp Trp Arg Asp Arg Asp Gin Ser Asp Gly Phe Gly Lys Val Tyr Val 
350 355 360 

ACA GAA AAA GTG GCT CAC GTA AAA CAA ATT CCA TTT GAT GCC TCA AAA 1157 
Thr Glu Lys Val Ala His Val Lys Gin He Pro Phe Asp Ala Ser Lys 
365 370 375 

TTG CAC AGC TCC CCA CAA ATG GCA GCC CAG CAT CAC GTG GTG GAT GAC 1205 
Leu His Ser Ser Pro Gin Met Ala Ala Gin His His Val Val Asp Asp 
380 385 390 

GGT TCT GGC AAA GTG CAG ATT TGG CGT GTA GAA AAC AAC GGT AGG GTC 1253 
Gly Ser Gly Lys Val Gin He Trp Arg Val Glu Asn Asn Gly Arg Val 
395 400 405 

GAA ATT GAC CGA AAC TCG TAT GGT GAA TTC TAT GGT GGT GAT TGC TAC 1301 
Glu He Asp Arg Asn Ser Tyr Gly Glu Phe Tyr Gly Gly Asp Cys Tyr 
410 415 420 425 

ATT ATA CTT TAC ACT TAT CCC AGA GGA CAG ATT ATC TAC ACC TGG CAA 134 9 

He He Leu Tyr Thr Tyr Pro Arg Gly Gin He He Tyr Thr Trp Gin 
430 435 440 

GGA GCA AAT GCC ACA CGG GAT GAG CTG ACA ACC TCC GCA TTC CTG ACT 1397 
Gly Ala Asn Ala Thr Arg Asp Glu Leu Thr Thr Ser Ala Phe Leu Thr 
445 450 455 

GTT CAG TTG GAT AGA TCC CTC GGG GGA CAG GCT GTG CAG ATT CGA GTC 14 4 5 

Val Gin Leu Asp Arg Ser Leu Gly Gly Gin Ala Val Gin He Arg Val 
460 465 470 

TCC CAA GGC AAA GAA CCT GCT CAC CTG CTG AGT TTG TTC AAA GAC AAA 14 93 

Ser Gin Gly Lys Glu Pro Ala His Leu Leu Ser Leu Phe Lys Asp Lys 
. 475 480 485 

CCG CTC ATT ATT TAC AAG AAC GGA ACA TCA AAG AAA GAA GGT CAG GCA 1541 
Pro Leu He He Tyr Lys Asn Gly Thr Ser Lys Lys Glu Gly Gin Ala 
490 495 500 505 
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CCA GCC CCC CCT ATA CGC CTC TTT CAA GTC CGA AGA AAC CTG GCT TCG 1589 
Pro Ala Pro Pro lie Arg Leu Phe Gin Val Arg Arg Asn Leu Ala Ser 
510 515 520 

ATC ACC AGA ATT ATG GAG GTA GAT GTT GAT GCA AAC TCA TTG AAT TCC 1637 
He Thr Arg He Met Glu Val Asp Val Asp Ala Asn Ser Leu Asn Ser 
525 530 535 

AAT GAT GTT TTT GTC CTG AAA CTG CGA CAA AAT AAT GGC TAC ATC TGG 1685 
Asn Asp Val Phe Val Leu Lys Leu Arg Gin Asn Asn Gly Tyr He Trp 
540 545 550 

ATA GGA AAA GGC TCC AC A CAG GAG GAG GAG AAA GGA GCA GAG TAC GTG 1733 
He Gly Lys Gly Ser Thr Gin Glu Glu Glu Lys Gly Ala Glu Tyr Val 
555 560 565 

GCA AGC GTC CTC AAA TGC AAA ACT TCG ACG ATT CAG G7^ GGC AAG GAA 17 81 

Ala Ser Val Leu Lys Cys Lys Thr Ser Thr He Gin Glu Gly Lys Glu 
570 575 580 585 

CCA GAG GAG TTT TGG AAT TCC CTT GGA GGG AAA AAA GAC TAC CAG ACC 1829 
Pro Glu Glu Phe Trp Asn Ser Leu Gly Gly Lys Lys Asp Tyr Gin Thr 
590 595 600 

TCT CCT CTG CTA GAA TCC CAG GCT GAA GAC CAT CCA CCT CGG CTT TAC 1877 
Ser Pro Leu Leu Glu Ser Gin Ala Glu Asp His Pro Pro Arg Leu Tyr 
605 610 615 

GGC TGC TCC AAC AAA ACT GGA AGA TTC ATT ATT GAA GAG GTT CCA GGA 1925 
Gly Cys Ser Asn Lys Thr Gly Arg Phe He He Glu Glu Val Pro Gly 
620 625 630 

GAG TTC ACC CAG GAT GAT TTA GCA GAA GAT GAT GTC ATG CTG TTA GAT 1973 
Glu Phe Thr Gin Asp Asp Leu Ala Glu Asp Asp Val Met Leu Leu Asp 
635 640 645 

GCT TGG GAA CAG ATT TTT ATT TGG ATT GGA AAA GAT GCC AAT GAA GTT 2021 
Ala Trp Glu Gin He Phe He Trp He Gly Lys Asp Ala Asn Glu Val 
650 655 660 665 

GAG AAA TCA GAA TCT CTG AAG TCT GCC AAA ATA TAC CTT GAG ACC GAC 20 69 

Glu Lys Ser Glu Ser Leu Lys Ser Ala Lys He Tyr Leu Glu Thr Asp 
670 675 680 

CCT TCT GGA AGA GAC AAG AGG ACG CCA ATT GTC ATC ATA AAA CAG GGT 2117 
Pro Ser Gly Arg Asp Lys Arg Thr Pro He Val He He Lys Gin Gly 
685 690 695 

CAT GAG CCA CCT ACT TTC ACA GGC TGG TTC CTG GGC TGG GAT TCC AGC 2165 
His Glu Pro Pro Thr Phe Thr Gly Trp Phe Leu Gly Trp Asp Ser Ser 
700 705 710 

AGG TGG TAAACTGATT TTTGTAGGAA AAAAACAAAT AT/^TGGGGC AGCTGTCCCA 2221 
Arg Trp 
715 

GGGGGGAAGG AGGAGCTTGT TTAACTTTAG AAAATTAACC TCAGCCATAT GGCTATTTTT 2281 
CCGTGCTTAG AATTGGTTTG AAATTTCTTT TAAACTGGAA TTTTCTTATG TTAATATTTT 2 341 
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TAT.AACTTTT CTTATGGACC AATATTAGCT CTGCTGGATG CTGACATATC TTTATATATG 24 01 
ACTTTTTAAA GGGGCCG ^^^^ 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 amino acids 

(B) TYPE: amino acid • 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0:5: 

Met Ala Gin Gly Leu Tyr His Glu Glu Phe Ala Arg Ala Gly Lys Arg 
1 5 10 15 

Ala Gly Leu Gin Val Trp Arg He Glu Lys Leu Glu Leu Val Pro Val 
20 25 30 

Pro Glu Ser Ala Tyr Gly Asn Phe Tyr Val Gly Asp Ala Tyr Leu Val 
35 40 45 

Leu His Thr Thr Gin Ala Ser Arg Gly Phe Thr Tyr Arg Leu His Phe 
50 55 60 

Trp Leu: Gly Lys Glu Cys Thr Gin Asp Glu Ser Thr Ala Ala Ala He 
65 70 75 80 

Phe Thr Val Gin Met Asp Asp Tyr Leu Gly Gly Lys Pro Val Gin Asn 
85 90 95 

Arg Glu Leu Gin Gly Tyr Glu Ser Thr Asp Phe Val Gly Tyr Phe Lys 
100 105 110 

Gly Gly Leu Lys Tyr Lys Ala Gly Gly Val Ala Ser Gly Leu Asn His 
115 120 125 

Val Leu Thr Asn Asp Leu Thr Ala Gin Arg Leu Leu His Val Lys Gly 
130 135 140 

Arg Arg Val Val Arg Ala Thr Glu Val Pro Leu Ser Trp Asp Ser Phe 
145 150 155 160 

Asn Lys Gly Asp Cys Phe He He Asp Leu Gly Thr Glu He Tyr Gin 
165 170 175 

Trp Cys Gly Ser Ser Cys Asn Lys Tyr Glu Arg Leu Lys Ala Ser Gin 
180 185 190 

Val Ala He Gly He Arg Asp Asn Glu Arg Lys Gly Arg Ala Gin Leu 
195 200 205 

He Val Val Glu Glu Gly Ser Glu Pro Ser Glu Leu Thr Lys Val Leu 
210 215 220 

Gly Glu Lys Pro Lys Leu Arg Asp Gly Glu Asp Asp Asp Asp He Lys 
225 230 235 240 
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Ala Asp lie Thr Asn Arg Lys Met Ala Lys Leu Tyr Met Val Ser Asp 
245 250 255 

Ala Ser Gly Ser Met Lys Val Ser Leu Val Ala Glu Glu Asn Pro Phe 
260 265 270 

Ser Met Ala Met Leu Leu Ser Glu Glu Cys Phe He Leu Asp His Gly 
275 280 285 

Ala Ala Lys Gin He Phe Val Trp Lys Gly Lys Asp Ala Asn Pro Gin 
290 295 300 

Glu Arg Lys Ala Ala Met Lys Thr Ala Glu Glu Phe Leu Gin Gin Met 
305 • 310 . 315 320 

Asn Tyr Ser Thr Asn Thr Gin He Gin Val Leu Pro Glu Gly Gly Glu 
325 330 335 

Thr Pro lie Phe Lys Gin Phe Phe Lys Asp Trp Arg Asp Arg Asp Gin 
340 345 350 

Ser Asp Gly Phe Gly Lys Val Tyr Val Thr Glu Lys Val Ala His Val 
355 360 365 

Lys Gin He Pro Phe Asp Ala Ser Lys Leu His Ser Ser Pro Gin Met 
370 375 380 

Ala Ala Gin His His Val Val Asp Asp Gly Ser Gly Lys Val Gin He 
385 390 395 400 

Trp Arg Val Glu Asn Asn Gly Arg Val Glu He Asp Arg Asn Ser Tyr 
405 410 415 

Gly Glu Phe Tyr Gly Gly Asp Cys Tyr He He Leu Tyr Thr Tyr Pro 
420 425 430 

Arg Gly Gin He He Tyr Thr Trp Gin Gly Ala Asn Ala Thr Arg Asp 
435 440 445 

Glu Leu Thr Thr Ser Ala Phe Leu Thr Val Gin Leu Asp Arg Ser Leu 
450 455 460 

Gly Gly Gin Ala Val Gin He Arg Val Ser Gin Gly Lys Glu Pro Ala 
465 470 475 480 

His Leu Leu Ser Leu Phe Lys Asp Lys Pro Leu He He Tyr Lys Asn 
485 490 495 

Gly Thr Ser Lys Lys Glu Gly Gin Ala Pro Ala Pro Pro He Arg Leu 
500 505 510 

Phe Gin Val Arg Arg Asn Leu Ala Ser He Thr Arg He Met Glu Val 
515 520 525 

Asp Val Asp Ala Asn Ser Leu Asn Ser Asn Asp Val Phe Val Leu Lys 
530 535 540 

Leu Arg Gin Asn Asn Gly Tyr He Trp He Gly Lys Gly Ser Thr Gin 
545 550 555 560 

Glu Glu Glu Lys Gly Ala Glu Tyr Val Ala Ser Val Leu Lys Cys Lys 
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565 



570 575 



Thr Ser Thr lie Gin Glu Gly Lys Glu Pro Glu Glu Phe Trp Asn Ser 
580 585 590 

Leu Gly Gly Lys Lys Asp Tyr Gin Thr Ser Pro Leu Leu Glu Ser Gin 
595 600 605 

Ala Glu Asp His Pro Pro Arg Leu Tyr Gly Cys Ser Asn Lys Thr Gly 
610 615 620 

Arg Phe lie lie Glu Glu Val Pro Gly Glu Phe Thr Gin Asp Asp Leu 
625 630 635 640 

Ala Glu Asp Asp Val Met Leu Leu Asp Ala Trp Glu Gin He Phe He 
645 650 655 

Trp He Gly Lys Asp Ala Asn Glu Val Glu Lys Ser Glu Ser Leu Lys 
660 665 670 

Ser Ala Lys He Tyr Leu Glu Thr Asp Pro Ser Gly Arg Asp Lys Arg 
675 680 685 

Thr Pro He Val He He Lys Gin Gly His Glu Pro Pro Thr Phe Thr 
690 695 700 

Gly Trp Phe Leu Gly Trp Asp Ser Ser Arg Trp 
705 710 715 

(2) INFORMATION FOR SEQ ID NO: 6: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 79.. 2223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AAGGTTCCTC CTGCTGCTCT CGGTTTAGTC CAAGATCAGC GATATCACGC GTCCCCCGGA 60 

GCATCGCGTG CAGGAGCC ATG GCG CGG GAG CTA TAG CAC GAA GAG TTC GCC 111 

Met Ala Arg Glu Leu Tyr His Glu Glu Phe Ala 
1 5 10 

CGG GCG GGC AAG CAG GCG GGG CTG CAG GTC TGG AGG ATT GAG AAG CTG 159 
Arg Ala Gly Lys Gin Ala Gly Leu Gin Val Trp Arg He Glu Lys Leu 
15 20 25 

GAG CTG GTG CCC GTG CCC CAG AGC GCT CAC GGC GAC TTC TAC GTC GGG 207 
Glu Leu Val Pro Val Pro Gin Ser Ala His Gly Asp Phe Tyr Val Gly 
30 35 40 
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GAT GCC TAG CTG GTG CTG CAC ACG GCC AAG ACG AGC CGA GGC TTC AGO 255 
Asp Ala Tyr Leu Val Leu His Thr Ala Lys Thr Ser Arg Gly Phe Thr 
45 50 55 

TAG CAC CTG CAC TTC TGG CTG GGA AAG GAG TGT TCC CAG GAT GAA AGC 303 
Tvr His Leu His Phe Trp Leu Gly Lys Glu Cys Ser Gin Asp Glu Ser 
60 65 70 75 

ACA GCT GCT GCC ATC TTC ACT GTT CAG ATG GAT GAC TAT TTG GGT GGC 351 
Thr Ala Ala Ala He Phe Thr Val Gin Met Asp Asp Tyr Leu Gly Gly 
80 85 90 

AAG CCA GTG CAG AAT AGA GAA CTT CAA GGA TAT GAG TCT AAT GAC TTT 399 
Lvs Pro Val Gin Asn Arg Glu Leu Gin Gly Tyr Glu Ser Asn Asp Phe 
95 100 105 

GTT AGC TAT TTC AAA GGC GGT CTG AAA TAG AAG GCT GGA GGC GTG GCA 4 47 

Val Ser Tyr Phe Lys Gly Gly Leu Lys Tyr Lys Ala Gly Gly Val Ala 
110 115 120 

TCT GGA TTA AAT CAT GTT CTT ACG AAC GAC CTG ACA GCC AAG AGG CTC 4 95 

Ser Gly Leu Asn His Val Leu Thr Asn Asp Leu Thr Ala Lys Arg Leu 
125 130 135 

CTA CAT GTG AAG GGT CGT AGA GTG GTG AGA GCC ACA GAA GTT CCC CTT 54 3 

Leu His Val Lys Gly Arg Arg Val Val Arg Ala Thr Glu Val Pro Leu 
140 145 150 155 

AGC TGG GAC AGT TTC AAC AAG GGT GAC TGC TTC ATC ATT GAC CTT GGC 591 
Ser Trp Asp Ser Phe Asn Lys Gly Asp Cys Phe He He Asp Leu Gly 
160 165 170 

ACC GAA ATT TAT CAG TGG TGT GGT TCC TCG TGC AAC AAA TAT GAA CGT 639 
Thr Glu He Tyr Gin Trp Cys Gly Ser Ser Cys Asn Lys Tyr Glu Arg 
175 180 185 

CTG AAG GCA AAC CAG GTA GCT ACT GGC ATT CGG TAG AAT GAA AGG AAA 687 
Leu Lys Ala Asn Gin Val Ala Thr Gly He Arg Tyr Asn Glu Arg Lys 
190 195 200 

GGA AGG TCT GAA CTA ATT GTC GTG GAA GAA GGA AGT GAA CCC TCA GAA 7 35 

Gly Arg Ser Glu Leu He Val Val Glu Glu Gly Ser Glu Pro Ser Glu 
205 210 215 

CTT ATA AAG GTC TTA GGG GAA AAG CCA GAG CTT CCA GAT GGA GGT GAT 7 83 

Leu He Lys Val Leu Gly Glu Lys Pro Glu. Leu Pro Asp Gly Gly Asp 
220 225 .230 235 

GAT GAT GAC ATT ATA GCA GAC ATA AGT AAC AGG AAA ATG GCT AAA CTA 8 31 

Asp Asp Asp He He Ala Asp He Ser Asn Arg Lys Met Ala Lys Leu 
240 245 250 

TAG ATG GTT TCA GAT GCA AGT GGC TCC ATG AGA GTG ACT GTG GTG GCA 87 9 

Tyr Met Val Ser Asp Ala Ser Gly Ser Met Arg Val Thr Val Val Ala 
255 260 265 

GAA GAA AAC CCC TTC TCA ATG GCA ATG CTG CTG TCT GAA GAA TGC TTT 927 
Glu Glu Asn Pro Phe Ser Met Ala Met Leu Leu Ser Glu Glu Cys Phe 
270 275 280 
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ATT TTG GAC CAC GGG GCT GCC AAA CAA ATT TTC GTA TGG AAA GGT AAA 97 5 

He Leu Asp His Gly Ala Ala Lys Gin He Phe Val Trp Lys Gly Lys 
285 290 296 

GAT GCT AAT CCC CAA GAG AGG AAG GCT GCA ATG AAG ACA GCT GAA GAA 1023 
ASP Ala Asn Pro Gin Glu Arg Lys Ala Ala Met Lys Thr Ala Glu Glu 
300 305 310 315 

TTT CTA CAG CAA ATG AAT TAT TCC AAG AAT ACC CAA ATT CAA GTT CTT 1071 
Phe Leu Gin Gin Met Asn Tyr Ser Lys Asn. Thr Gin He Gin Val Leu 
320 325 330 

CCA GAA GGA GGT GAA ACA CCA ATC TTC AAA CAG TTT TTT AAG GAC TGG 1119 
Pro Glu Gly Gly Glu Thr Pro He Phe Lys Gin Phe Phe Lys Asp Trp 
335 340 345 

AGA GAT AAA GAT CAG AGT GAT GGC TTC GGG AAA GTT TAT GTC ACA GAG 1167 
Arg Asp Lys Asp Gin Ser Asp Gly Phe Gly Lys Val Tyr Val Thr Glu 
350 355 360 

AAA GTG GCT CAA ATA AAA CAA ATT CCC TTT GAT GCC TCA AAA TTA CAC 1215 
Lys Val Ala Gin He Lys Gin He Pro Phe Asp Ala Ser Lys Leu His 
365 370 375 

AGT TCT CCG CAG ATG GCA GCC CAG CAC AAT ATG GTG GAT GAT GGT TCT 1263 
Ser Ser Pro Gin Met Ala Ala Gin His Asn Met Val Asp Asp Gly Ser 
380 385 390 395 

■ GGC AAA GTG GAG ATT TGG CGT GTA GAA AAC AAT GGT AGG ATC CAA GTT 1311 
Glv Lys Val Glu He Trp Arg Val Glu Asn Asn Gly Arg He Gin Val 
400 405 410 

GAC CAA AAC TCA TAT GGT GAA TTC TAT GGT GGT GAC TGC TAC ATC ATA 1359.. 
Asp Gin Asn Ser Tyr Gly Glu Phe Tyr Gly Gly Asp Cys Tyr He He 
415 420 425 

CTC TAC ACC TAT CCC AGA GGA CAG ATT ATC TAC ACG TGG CAA GGA GCA 14 07 

Leu Tyr Thr Tyr Pro Arg Gly Gin He He Tyr Thr Trp Gin Gly Ala' 
430 435 440 

AAT GCC ACA CGA GAT GAG CTG ACA ACA TCT GCG TTC CTG ACT GTT CAG 14 55 

Asn Ala Thr Arg Asp Glu Leu Thr Thr Ser Ala Phe Leu Thr Val Gin 
445 450 455 

TTG GAT CGG TCC CTT GGA GGA CAG GCT GTG CAG ATC CGA GTC TCC CAA 1503 
Leu Asp Arg Ser Leu Gly Gly Gin Ala Val Gin He Arg Val Ser Gin 
460 465 470 475 

GGC AAA GAG CCT GTT CAC CTA CTG AGT TTG TTC AAA GAC AAA CCG CTC 1551 
Glv Lys Glu Pro Val His Leu Leu Ser Leu Phe Lys Asp Lys Pro Leu 
480 485 490 

ATT ATT TAC AAG AAT GGA ACA TCA AAG AAA GGA GGT CAG GCA CCT GCT 1599 
He He Tyr Lys Asn Gly Thr Ser Lys Lys Gly Gly Gin Ala Pro Ala 
495 500 505 

CCC CCT ACA CGC CTC TTT CAA GTC CGG AGA AAC CTG GCA TCT ATC ACC 164 7 

Pro Pro Thr Arg Leu Phe Gin Val Arg Arg Asn Leu Ala Ser He Thr 
510 515 520 

AGA ATT GTG GAG GTT GAT GTT GAT GCA AAT TCA CTG AAT TCT AAC GAT 1695 
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Arg He Val Glu Val Asp Val Asp Ala Asn Ser Leu Asn Ser Asn Asp 
525 530 535 

GTT TGT GTC CTG AAA CTG CCA CAA AAT AGT GGC TAC ATC TGG GTA GGA 17 4 3 

Val Cys Val Leu Lys Leu Pro Gin Asn Ser Gly Tyr He Trp Val Gly 
540 545 550 555 

AAA GGT GCT AGC CAG GAG GAG GAG AAA GGA GCA GAG TAT GTA GCA AGT 17 91 

Lys Gly Ala Ser Gin Glu Glu Glu Lys Gly. Ala Glu Tyr Val Ala Ser 
560 565 570 

GTC CTA AAG TGC AAA ACC TTA AGG ATC CAA GAA GGC GAG GAG CCA GAG 1839 
Val Leu Lys Cys Lys Thr Leu Arg He Gin Glu Gly Glu Glu Pro Glu 
575 580 585 

GAG TTC TGG AAT TCC CTT GGA GGG AAA AAA GAC TAC CAG ACC TCA CCA 1887 
Glu Phe Trp Asn Ser Leu Gly Gly Lys Lys Asp Tyr Gin Thr Ser Pro 
590 595 600 

CTA CTG GAA ACC CAG GCT GAA GAC CAT CCA CCT CGG CTT TAC GGC TGC 1935 
Leu Leu Glu Thr Gin Ala Glu Asp His Pro Pro Arg Leu Tyr Gly Cys 
605 610 615 

TCT AAC AAA ACT GGA AGA TTT GTT ATT GAA GAG ATT CCA GGA GAG TTC 198 3 

Ser Asn Lys Thr Gly Arg Phe Val He Glu Glu He Pro Gly Glu Phe 
620 625- 630 635 

ACC CAG GAT GAT TTA GCT GAA GAT GAT GTC ATG TTA CTA GAT GCT TGG 2031 
Thr Gin Asp Asp Leu Ala Glu Asp Asp Val Met Leu Leu Asp Ala Trp 
640 645 650 

GAA CAG ATA TTT ATT TGG ATT GGC AAA GAT GCT AAT GAA GTT GAG AAA 207 9 

Glu. Gin He Phe He Trp He Gly Lys Asp Ala Asn Glu Val Glu Lys 
655 660 665 

AAA GAA TCT CTG AAG TCT GCC AAA ATG TAC CTT GAG ACA GAC CCT TCT 2127 
Lys Glu Ser Leu Lys Ser Ala Lys Met Tyr Leu Glu Thr Asp Pro Ser 
670 675 680 

GGA AGA GAC AAG AGG ACA CCA ATT GTC ATC ATA AAA CAG GGC CAT GAG 217 5 

Gly Arg Asp Lys Arg Thr Pro He Val He He Lys Gin Gly His Glu 
685 690 695 

CCA CCC ACA TTC ACA GGC TGG TTC CTG GGC TGG GAT TCC AGC AAG TGG 2223 
Pro Pro Thr Phe Thr Gly Trp Phe Leu Gly Trp Asp Ser Ser Lys Trp 
700 705 710 715 

TAAATTGGTA TTTGTAAAAA GCAAACAAAC ATTACAAGGC AGTTATCTCA TTGCTGTTTT 22 83 

GGGAGAGGAA CGGGAAAAGC TTTTTGCTTA TTTGTCTTTT GAAAATTAAG GCTGGGCGCG 234 3 

GTGGCTCACA CCTGTAATCC CAGCACTTTG AGAGGATGAG GTAGGCGGAT CACTGGGGTC 24 03 

AGGATTTCGA GACCAGCCTG GCCAACATGG CGAAACCTCG CCTCTACTAA AAATACAAAA 24 63 

AAATTAGCTG CGCGTGGTGG TGCACGCCTG TAGTCCCTGC TACTTGGAAG GCTGAGACAG 2523 

GAAAATTGCT TGAGCCCAGG AGGCTGAGGT TGCAGTGAGC CAGGATTGCG CCACCACACT 2 583 

CCAGCCTGGG CAACAGAGAC TCTGTCTCAA AAAAAAAAAA AAAAAAA 2 630 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ala Arg Glu Leu Tyr His Glu Glu Phe Ala Arg Ala Gly Lys Gin 
1 5 10 15 

Ala Gly Leu Gin Val Trp Arg He Glu Lys Leu Glu Leu Val Pro Val 
20 25 30 

Pro Gin Ser Ala His Gly Asp Phe Tyr- Val Gly Asp Ala Tyr Leu Val 
35 40 45 

Leu His Thr Ala Lys Thr Ser Arg Gly Phe Thr Tyr His Leu His Phe 
50 55 60 

Trp Leu Gly Lys Glu Cys Ser Gin Asp Glu Ser Thr Ala Ala Ala He 
65 70 75 80 

Phe Thr Val Gin Met Asp Asp Tyr Leu Gly Gly Lys Pro Val Gin Asn 
85 90 95 

Arg Glu Leu Gin Gly Tyr Glu Ser Asn Asp Phe Val Ser Tyr Phe Lys 
100 105 110 

Gly Gly Leu Lys Tyr Lys Ala Gly Gly Val Ala Ser Gly Leu Asn His 
115 120 125 

Val Leu Thr Asn Asp Leu Thr Ala Lys Arg Leu Leu His Val Lys Gly 
130 135 140 

Arg Arg Val Val Arg Ala Thr Glu Val Pro Leu Ser Trp Asp Ser Phe 
145 150 155 160 

Asn Lys Gly Asp Cys Phe He He Asp Leu Gly Thr Glu He Tyr Gin 
165 170 1'75 

Trp Cys Gly Ser Ser Cys Asn Lys Tyr Glu Arg Leu Lys Ala Asn Gin 
180 185 190 

Val Ala Thr Gly He Arg Tyr Asn Glu Arg Lys Gly Arg Ser Glu Leu 
195 200 205 

He Val Val Glu Glu Gly Ser Glu Pro Ser Glu Leu He Lys Val Leu 
210 215 220 

Gly Glu Lys Pro Glu Leu Pro Asp Gly Gly Asp Asp Asp Asp He He 
225 230 235 240 

Ala Asp He Ser Asn Arg Lys Met Ala Lys Leu Tyr Met Val Ser Asp 
245 250 255 

Ala Ser Gly Ser Met Arg Val Thr Val Val Ala Glu Glu Asn Pro Phe 
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Ser Met Ala Met 
275 

Ala Ala Lys Gin 
290 

Glu Arg Lys Ala 

305 - 

Asn Tyr Ser Lys 



Thr Pro lie Phe 
340 

Ser Asp Gly Phe 
355 

Lys Gin lie Pro 
370 

Ala Ala Gin His 
385 

Trp Arg Val Glu 



Gly Glu Phe Tyr 
420 

Arg Gly Gin lie 
435 

Glu Leu Thr Thr 
450 

Gly Gly Gin Ala 
465 

His Leu Leu Ser 



Gly Thr Ser Lys 
500 

Phe Gin Val Arg 
515 

Asp Val Asp Ala 
530 

Leu Pro Gin Asn 
545 

Glu Glu Glu Lys 



Thr Leu Arg lie 
580 



Leu Leu Ser Glu 
280 

He Phe Val Trp 
295 

Ala Met Lys Thr 
310 

Asn Thr Gin He 
325 

Lys Gin Phe Phe 



Gly Lys Val Tyr 
360 

Phe Asp Ala Ser 
375 

Asn Met Val Asp 
390 

Asn Asn Gly Arg 
405 

Gly Gly Asp Cys 



He Tyr Thr Trp 
440 

Ser Ala Phe Leu 
455 

Val Gin He Arg 
470 

Leu Phe Lys Asp 
485 

Lys Gly Gly Gin 



Arg Asn Leu Ala 
520 

Asn Ser Leu Asn 
535 

Ser Gly Tyr He 
550 

Gly Ala Glu Tyr 
565 

Gin Glu Gly Glu 



265 

Glu Cys Phe He 



Lys Gly Lys Asp 
300 

Ala Glu Glu Phe 
315 

Gin Val Leu Pro 
330 

Lys Asp Trp Arg 
345 

Val Thr Glu Lys 



Lys Leu His Ser 
380 

Asp Gly Ser Gly 
395 

He Gin Val Asp 
410 

Tyr He He Leu 
425 

Gin Gly Ala Asn 



Thr Val Gin Leu 
460 

Val Ser Gin Gly 
475 

Lys Pro Leu He 
490 

Ala Pro Ala Pro 
505 

Ser He Thr Arg 



Ser Asn Asp Val 
540 

Trp Val Gly Lys 
555 

Val Ala Ser Val 
570 

Glu Pro Glu Glu 
585 



270 

Leu Asp His Gly 
285 

Ala Asn Pro Gin 



Leu Gin Gin Met 
320 

Glu Gly Gly Glu 
335 . 

Asp Lys Asp Gin 
350 

Val Ala Gin He 
365 

Ser Pro Gin Met 



Lys Val Glu He 
400 

Gin Asn Ser Tyr 
415 

Tyr Thr Tyr Pro 
430 

Ala Thr Arg Asp 
445 

Asp Arg Ser Leu 



Lys Glu Pro Val 
480 

He Tyr Lys Asn 
495 

Pro Thr Arg Leu 
510 

He Val Glu Val 
525 

Cys Val Leu Lys 



Gly Ala Ser Gin 
560 

Leu Lys Cys Lys 
575 

Phe Trp Asn Ser 
590 
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TK-r Q<ar Pro Leu Leu Glu Thr Gin 
Leu Gly Gly Lys Lys Asp Tyr Gin Thr Ser Pro i.eu u 

595 ^00 

Ala Glu ASP His Pro Pro Arg Leu Tyr Gly Cys Ser Asn Lys Thr Gly 
610 615 620 

Arg Phe Val He Glu Glu He Pro Gly Glu Phe Thr Gin Asp Asp Leu 



625 



630 



Ala Glu ASP ASP Val Met Leu Leu Asp Ala Trp Glu Gin He Phe He 
645 



655 



Trp lie Gly Lys Asp Ala Asn Glu Val Glu Lys Lys Glu Ser Leu Lys 
^ ^ ^ 665 



660 



ser Ala Lys Met Tyr Leu Glu Thr Asp Pr6 Ser Gly Arg Asp Lys Arg 



675 



680 



Thr Pro He Val He He Lys Gin Gly His Glu Pro Pro Thr Phe Thr 
690 695 . 

Gly Trp Phe Leu Gly Trp Asp Ser Ser Lys Trp 

705 -710 ''^^ 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Asn His Val Leu Thr Asn Asp Leu Thr Ala Lys Arg Leu Leu His 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
Lys val Tyr Val Thr Glu Lys Val Ala Gin He Lys Gin He Pro Phe 
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1 



5 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Pro His Arg Pro Ala Pro Ala Leu Leu Cys Ala Leu Ser Leu 

1 . 5 10 .15 

Ala Leu Cys Ala Leu Ser Leu Pro Val Arg Ala Ala Thr Ala Ser Arg 
20 25 30 

Gly Ala Ser Gin Ala Gly Ala Pro Gin Gly Arg Val Pro Glu Ala Arg 
35 40 45 

Pro Asn Ser Met Val Val Glu His Pro Glu Phe Leu Lys Ala Gly Lys 
50 55 60 

Glu Pro Gly Leu Gin He Trp Arg Val Glu Lys Phe Asp Leu Val Pro 
65 10 75 80 

Val Pro Thr Asn Leu Tyr Gly Asp Phe Phe Thr Gly Asp Ala Tyr Val 
85 90 

He Leu Lys Thr Val Gin Leu Arg Asn Gly Asn Leu Gin Tyr Asp Leu 
100 105 110 

His Tyr Trp Leu Gly Asn Glu Cys Ser Gin Asp Glu Ser Gly Ala Ala 
115 120 125 

Ala lie Phe Thr Val Gin Leu Asp Asp Tyr Leu Asn Gly Arg Ala Val 
130 135 140 

Gin His Arg Glu Val Gin Gly Phe Glu Ser Ala Thr Phe Leu Gly Tyr 
145 150 155 

Phe Lys ser Gly Leu Lys Tyr Lys Lys Gly Gly Val Ala Ser Gly Phe 
165 170 175 

Lys His Val Val Pro Asn Glu Val Val Val Gin Arg Leu Phe Gin Val 
180 185 190 

Lys Gly Arg Arg Val Val Arg Ala Thr Glu Val Pro Val Ser Trp Glu 
^ igg 200 205 

Ser Phe Asn Asn Gly Asp Cys Phe He Leu Asp Leu Gly Asn Asn He 
210 215 220 

His Gin Trp Cys Gly Ser Asn Ser Asn Arg Tyr Glu Arg Leu Lys Ala 
225 230 235 240 
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Thr Gin Val Ser Lys Gly He Arg Asp Asn Giu Arg Ser Gly Arg Ala 

245 250 255 

Arq Val His Val Ser Glu Glu Gly Thr Glu Pro Glu Ala Met Leu Gin 

260 265 270 



Val Leu Gly Pro Lys Pro Ala Leu Pro Ala Gly Thr Glu Asp Thr Ala 
275 280 285 

Lys Glu Asp Ala Ala Asn Arg Lys Leu Ala Lys Leu Tyr Lys Val Ser 
290 295 300 

Asn Glv Ala Gly Thr Met Ser Val Ser Leu Val Ala Asp Glu Asn Pro 
305 310 315 320 

Phe Ala Gin Gly Ala Leu Lys Ser Glu Asp Cys Phe He Leu Asp His 
325 330 335 

Gly Lys Asp Gly Lys He Phe Val Trp Lys Gly Lys Gin Ala Asn Thr 
340 345 350 

Glu Glu Arg Lys Ala Ala Leu Lys Thr Ala Ser Asp Phe He Thr Lys 
355 360 365 

Met Asp Tyr Pro Lys Gin Thr Gin Val Ser Val Leu Pro Glu Gly Gly 
370 375 380 

Glia Thr Pro Leu Phe Lys Gin Phe Phe Lys Asn Trp Arg Asp Pro Asp 
385 390 395 400 

Gin Thr Asp Gly Leu Gly Leu Ser Tyr Leu Ser Ser His He Ala Asn 
405 410 415 

Val Glu Arg Val Pro Phe Asp Ala Ala Thr Leu His Thr Ser Thr Ala 
420 42.5 430 

Met Ala Ala Gin His Gly Met Asp Asp Asp Gly Thr Gly Gin Lys Gin 
435 440 445 

He Trp Arg He Glu Gly Ser Asn Lys Val Pro Val Asp Pro Ala Thr 
450 455 460 

Tyr Gly Gin Phe Tyr Gly Gly Asp Ser Tyr He He Leu Tyr Asn Tyr 
465 470 475 480 

Arg His Gly Gly Arg Gin Gly Gin He He Tyr Asn Trp Gin Gly Ala 
485 490 495 

Gin Ser Thr Gin Asp Glu Val Ala Ala Ser Ala He Leu Thr Ala Gin 
500 505 510 

Leu Asp Glu Glu Leu Gly Gly Thr Pro Val Gin Ser Arg Val Val Gin 
515 520 525 

Gly Lys Glu Pro Ala His Leu Met Ser Leu Phe Gly Gly Lys Pro Met 
530 535 540 

He He Tyr Lys Gly Gly Thr Ser Arg Glu Gly Gly Gin Thr Ala Pro 
545 550 555 560 
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Ala Ser Thr Arg Leu Phe Gin Val Arg.Ala Asn Ser Ala Gly Ala Thr 
565 570 575 

Arg Ala Val Glu Val Leu Pro Lys Ala Gly Ala Leu Asn Ser Asn Asp 
580 585 590 

Ala Phe Val Leu Lys Thr Pro Ser Ala Ala Tyr Leu Trp Val Gly Thr 
595 600 605 

Gly Ala Ser Glu Ala Glu Lys Thr Gly Ala Gin Glu Leu Leu Arg Val 
610 615 620 

Leu Arg Ala Gin Pro Val Gin Val Ala Glu Gly Ser Glu Pro Asp Gly 
625 630 635 640 

Phe Trp Glu Ala Leu Gly Gly Lys Ala Ala Tyr Arg Thr Ser Pro Arg 
645 650 655 

Leu Lys Asp Lys Lys Met Asp Ala His Pro Pro Arg Leu Phe Ala Cys 
660 665 670 

Ser Asn Lys lie Gly Arg Phe Val lie Glu Glu Val Pro Gly Glu Leu 
675 680 685 

Met Gin Glu Asp Leu Ala Thr Asp Asp Val Met Leu Leu Asp Thr Trp 
690 695 700 

Asp Gin Val Phe Val Trp Val Gly Lys Asp Ser Gin Glu Glu Glu Lys 
705 710 715 720 

Thr Glu Ala Leu Thr Ser Ala Lys Arg Tyr lie Glu Thr Asp Pro Ala 
725 730 735 

Asn Arg Asp Arg Arg Thr Pro lie Thr Val Val Lys Gin Gly Phe Glu 
740 745 750 

Pro Pro Ser Phe Val Gly Trp Phe Leu Gly Trp Asp Asp Asp Tyr Trp 
755 760 765 

Ser Val Asp Pro Leu Asp Arg Ala Met Ala Glu Leu Ala Ala 
770 775 780 

INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Thr Lys Leu Ser Ala Gin Val Lys Gly Ser Leu Asn lie Thr Thr 
15 10 15 

Pro Gly Leu Gin He Trp Arg He Glu Ala Met Gin Met Val Pro Val 
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20 



25 



30 



Pro Ser Ser Thr Phe Gly Ser Phe Phe Asp Gly Asp Cys Tyr lie lie 
35 40. 45 

Leu Ala lie His Lys Thr Ala Ser Ser Leu Ser Tyr Asp lie His Tyr 
50 55 60 

Trp He Gly Gin Asp Ser Ser Leu Asp Glu Gin Gly Ala Ala Ala He 
65 70 75 80 

Tyr Thr Thr Gin Met Asp Asp Phe Leu Lys Gly Arg Ala Val Gin His 
85 90 95 

Arg Glu Val Gin Gly Asn Glu Ser Glu Ala Phe Arg Gly Tyr Phe Lys 
100 105 110 

Gin Gly Leu Val He Arg Lys Gly Gly Val Ala Ser Gly Met Lys His 
115 120 125 

Val Glu Thr Asn Ser Tyr Asp Val Gin Arg Leu Leu His Val Lys Gly 
130 135 140 

Lys Arg Asn Val Val Ala Gly Glu Val Glu Met Ser Trp Lys Ser Phe 
145 150 155 160 

Asn Arg Gly Asp Val Phe Leu Leu Asp Leu Gly Lys Leu He He Gin 
165 170 175 

Trp Asn Gly Pro Glu Ser Thr Arg Met Glu Arg Leu Arg Gly Met Thr 
180 185 190 

Leu Ala Lys Glu He Arg Asp Gin Glu Arg Gly Gly Arg Thr Tyr Val 
195 200 205 

Gly Val Val Asp Gly Glu Asn Glu Leu Ala Ser Pro Lys Leu Met Glu 
210 215 220 

Val Met Asn His Val Leu Gly Lys Arg Arg Glu Leu Lys Ala Ala Val 
225 230 235 240 

Pro Asp Thr Val Val Glu Pro Ala Leu Lys Ala Ala Leu Lys Leu Tyr 
245 250 255 

His Val Ser Asp Ser Glu Gly Asn Leu Val Val Arg Glu Val Ala Thr 
260 265 270 

Arg Pro Leu Thr Gin Asp Leu Leu Ser His Glu Asp Cys Tyr He Leu 
275 280 285 

Asp Gin Gly Gly Leu Lys He Tyr Val Trp Lys Gly Lys Lys Ala Asn 
290 295 300 

Glu Gin Glu Lys Lys Gly Ala Met Ser His Ala Leu Asn Phe He Lys 
305 310 315 320 

Ala Lys Gin Tyr Pro Pro Ser Thr Gin Val Glu Val Gin Asn Asp Gly 
325 330 335 

Ala Glu Ser Ala Val Phe Gin Gin Leu Phe Gin Lys Trp Thr Ala Ser 
340 345 350 
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Asn Arg Thr Ser 
355 

Lys Val Glu Gin 
370 

Gin Val Ala Ala 
385 

Gin Val Trp Arg 



Trp Leu Gly His 
420 

Tyr Leu lie Gly 
435 

Ser Gin Ala Ser 
450 

lie Leu Asp Gin 
465 

Met Gly Lys Glu 



Val Val Tyr Gin 
500 

Pro Ser Thr Arg 
515 

Lys Ala Phe Glu 
530 

Val Phe Val Leu 
545 

Gly Cys Ser Gly 



He Ser Arg Thr 
580 

Asn Phe Trp Met 
595 

Arg Leu Gin Glu 
610 

Ser Asn Lys Thr 
625 

Gin Asp Asp Leu 



Gin Val Phe Phe 
660 



Gly Leu Gly Lys 
360 

Val Lys Phe Asp 
375 

Gin Gin Lys Met 
390 

He Glu Asn Leu 
405 

Phe Tyr Gly Gly 



Glu Lys Gin His 
440 

Gin Asp Glu He 
455 

Lys Tyr Asn Gly 
470 

Pro Pro His Leu 
485 

Gly Gly Thr Ser 



Leu Phe Gin Val 
520 

Val Pro Ala Arg 
535 

Lys Thr Gin Ser 
550 

Asp Glu Arg Glu 
565 

Glu Lys Gin Val 



Ala Leu Gly Gly 
600 

Glu Asn Leu Val 
615 

Gly Arg Phe Leu 
630 

Glu Glu Asp Asp 
645 

Trp He Gly Lys 



Thr His Thr Val 



Ala Thr Ser Met 
380 

Val Asp Asp Gly 
• 395 

Glu Leu Val Pro 
410 

Asp Cys Tyr Leu 
425 

Tyr Leu Leu Tyr 



Thr Ala Ser Ala 
4 60 

Glu Pro Val Gin 
475 

Met Ser He Phe 
4 90 

Arg Thr Asn Asn 
505 

Gin Gly Thr Gly 



Ala Asn Phe Leu 
540 

Cys Cys Tyr Leu 
555 

Met Ala Lys Met 
570 

Val Val Glu Gly 
585 

Lys Ala Pro Tyr 



He Thr Pro Arg 
620 

Ala Thr Glu He 
635 

Val Phe Leu Leu 
650 

His Ala Asn Glu 
665 



Gly Ser Val Ala 
365 

His Val Lys Pro 



Ser Gly Glu Val 
400 

Val Asp Ser Lys 
415 

Leu Leu Tyr Thr 
430 

Val Trp Gin Gly 
445 

Tyr Gin Ala Val 



He Arg Val Pro 
480 

Lys Gly Arg Met 
495 

Leu Glu Thr Gly 
510 

Ala Asn Asn Thr 
525 

Asn Ser Asn Asp 



Trp Cys Gly Lys 
560 

Val Ala Asp Thr 
575 

Gin Glu Pro Ala 
590 

Ala Asn Thr Lys 
605 

Leu Phe Glu Cys 



Pro Asp Phe Asn 
640 

Asp Val Trp Asp 
655 

Glu Glu Lys Lys 
670 
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Ala Ala Ala Thr 
675 

Arg Asp Pro Glu 
690 

Pro Thr Phe Thr 
705 

Asn Thr Lys Ser 



Asp Trp Ser Gin 
740 

Phe Asn Ala Asn 
755 

Leu Glu Gin Leu 
770 

Asp Pro Ser Arg 
785 

Ala Phe Gly Met 



Gin Gin Asn Leu 
820 



Thr Ala Gin Glu 
680 

Thr Pro He He 
695 

Gly Trp Phe Leu 
710 

Tyr Glu Asp Leu 
725 

He Thr Ala Glu 



Ser Asn Leu Ser 
760 

Val Asn Lys Pro 
775 

Lys Glu Glu His 
790 

Thr Pro Ala Ala 
805 

Lys Lys Glu Lys 



Tyr Leu Lys Thr 



Val Val Lys Gin 
700 

Ala Trp Asp Pro 
715 

Lys Ala Glu Ser 
730 

Val Thr Ser Pro 
745 

Ser Gly Pro Leu 



Val Glu Glu Leu 
780 

Leu Ser lie Glu 
795 

Phe Ser Ala Leu 
810 

Gly Leu Phe 
825 



His Pro Ser Gly 
685 

Gly His Glu Pro 



Phe Lys Trp Ser 
720 

Gly Asn Leu Arg 
735 

Lys Val Asp Val 
750 

Pro He Phe Pro 
765 

Pro Glu Gly Val 



Asp Phe Thr Gin 
800 

Pro Arg Trp Lys 
815 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATGCGGATC CAAYGAYYTN ACNGCNCA 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
. (A) DESCRIPTION: /desc = "PRIMER" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GATGCATCGA TACRTGNGCN ACYTTYTC 



( 2 ) INFORMATION_FpR ^ S^EQ I D^NO : IJ : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
CTCGAGGGTG GCGACGACTC C 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GCGGCCGCTT GACACCAGAC CAA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHAFIACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



66 



CAGCTATGAC CATGATTACG CCAA 



24 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
■(B) TYPE: nucleic acid 
" (C) STlOVNDEDNESSr~sirigle' 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ACGACGGCCA GTGAATTGCG TAAT 2 4 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Lys Val Ala Lys Val Glu Gin Val Lys Phe Asp Ala 
15 10 
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Sequence Listing 
SEQ ID NO: 
Sequence length: 
Sequence Type: 
5 Topology: 

Molecule type: 
Sequence description : 

SEQ ID NO: 
10 Sequence length: 
Sequence Type: 
Topology : 
Molecule type: 
Sequence description : 

15 

SEQ ID NO: 
Sequence length: 
Sequence Type: 
Topology: 
20 Molecule type: 

Sequence description: 

SEQ ID NO: 
Sequence length: 
25 Sequence Type: 
Strandeness: 
Topology : 
Molecule type: 



1 

12 

amino acid 
linear 
peptide 
KVAHVKQIPFDA 

2 
9 

amino acid 
linear 
peptide 
VLTNDLTAQ 

3 
5 

amino acid 
linear 
peptide 
ITNRK 

4 

2418 

nucleic acid 
double 
1 inear 
cDNA 
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Sequence characteristic : 

Symbol Showing Characteristic: mat peptide 
Location: 27. . 2171 

Sequence description: 
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0QG003GAAC ATCG07IGCC CGAGTC ATC GCXT CAG GOG CTC TAC CAC 47 

Met Ala Gin Gly Leu Tyr His 
1 5 

GAG GAG TIC GCC OSC GOG G3C AAi3 OCXS GCG GGG CIG GTC TGG AGA 95 
Glu Glu Phe Ala Arg Ala Gly Lys Arg Ala Gly Leu Gin Val Trp Arg 
10 15 20 

ATT aAG AAG CIG GftG CTC GIG COG GIG OCC Q^C AGC GCG lAT GGC AAC 143 
He Glu Lys Leu Glu Leu Val Pro Val Pro Glu Ser Ala Tvr Glv Asn 
25 30 35 

TIC TAC GTC G3G GAT GCC T?^ CIG GIG CIC C^y; ACG AiDG Qi^G GCC AGC 191 
Phe Tyr Val Gly A^ Ala Tyr Leu Val Leu His Tnx Ihr Gin AJLa Ser 
40 45 50 55 



OQ3 GGC TIC ACC TAC OGC CIG CAC TIC T3G CIG GGA AJ^ GAG TGT ACT 
Arg Gly Phe Ihr Tyr Arg Leu Kis Phe Trp Leu Gly Lys Glu Cys Ihr 
60 65 70 
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QAG GAT GAA AGC ACA GCA GCT GCC ATC TIT ACT GTT CAG ATG GAT QAC 287 
Gin A^ Glu Ser Ibr A^a A^a Ala He Phe Tnr Val Gin bfet Asp Asp 
75 80 85 " " 

ThT TIG GST GSC AAA OCT GIG CAG AJ«2 AGA GAA CIT CAA GGC TAT QAG 335 
Tyr Leu Gly Gly Lys Pro Val Gin A^. Ajrg Glu Leu Gin Gly Tyr Glu 
90 95 100 

TCT AOG GAT TIT GTT GGC m: TIT AAA G3A GGT CIG AJA 'mc AAG GCT 383 
Ser Thr Asp Phe Val Gly Tyr Phe Lys Gly Gly Leu Lys lyr Lvs Ala 
105 110 115 

GGC GGT GTG GOG TCT GGA CIC AAT CAT GIG CTT ACA AAT GAC TIG ACT 431 
Gly Gly Val AJLa Ser Gly Leu Ajsn His Val Leu Thr A^ Asp Leu Thr 
120 125 130 ^ 135 

GCT CAG AGG CIC CIG CAT GTG A_AA GGT.Q3G AGA GTC GIC AGG GCC ACG 479 
AJLa Gin Arg Leu Kis Val Lys Gly Ajrg Arg Val Val Arg Ala Thr 
140 145 150 

GAA GTT CCC CIA AGC TGG GAC AGT TIC AAC AAG GGT GAC TGC TIC ATC 527 
Glu Val Pro Leu Ser Trp Asp Sex Phe Asn Lys Gly Asp Cys Phe He 
155 160 155 

ATT GAC CTT GGC ACT GAA ATT TAC CAG TGG TGT GGA TCT TCT TGC AAC 575 
He A^ Leu Gly Thr Glu He Tyr Gin Trp Cys Gly Ser Ser Cys Asn 
170 175 ' 180 
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A;^ GAG CGC CIT3 AAG GCC ;>GC 0.3 CTT GCC ATC GGC ATT CGG GAC 

Lys TVr Glu Arg Lys Ala Ser Glx. Val Ala He Gly He Arg Asp 
185 190 
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AAT GAA AGG AAA G3C AG;>. GCT CAG CIG ATT GIC C7m av. GAA G3= 671 
Asn Glu Arg Lys Gly Arg AJa Gin Leu He Val Val Glu Glu Gly Ser 

205 210 215 

GAA CCA ICA a^G CTT .ACA AJ^ GI- TT^. QGG GA.:^. .^13 CCA AAG CTT AGG 7ia 
Glu Pro Ser Glu Thr Lys Val Leu Gly Glu Lvs Pro Lvs Leu x^-xi 

225 - "230 

GAT GGA G.T G^T GAT G^r ATC A^A GCA G^T AT?. NTT A^T Ai3G AAA 767 
Asp Gly Glu Asp A^ Asp A^ He Lys A^a A^ He Ihr A^ ^rg Lvs 
235 240 245 



AT3 GCr AAA CIC m: ATC GTT TCA GAT GCC AGT GGC ICC ATG AA^ GIG 
Met Ala Lys I^a TVr ^fet Val Ser Asp Ala Ser Gly Ser ^fet Lys vLl 

255 260 

AGT CIG GIG GCA aAA GAA A^ CCC TIC ICC ATG GCG ATC CTT CIG TCP 
Ser Val AJ.a Glu Glu A^. Pro Phe Ser Ifet Ala Jfet Leu Lsu ^ 
265 270 275 

GAA QAA TGC TIC ATT TIG GAG CAC OCT GCT GCA AAA. CAG ATT TIT Gm 
Glu Glu Cys Phe He A^ His Gly AJ.a AJ.a Lvs Gin He Ph- Val 

285 290 " ' ^235 

103 AA^ GGT A-A^ GVT GOT AAT CCC CPG GPG AGA AAi3 GOT GCC An^ AAG 
Trp Lys Gly Lys Asp Ala Asn Pro GLn Glu Arg Lys Ala Ala Ifet Lvs 
. 305 " 



815 



863 



911 



959 



AHA GOT GAG GAA TIC Cm CAG CA^ ATC AAT lAT ICT AAT ACC CAA 1007 
li^r Ala Glu Glu Phe Glx. Gl_n Kfet Asn lyr Ser Thr Asn TV^ Gin 
315 320 325 

ATT CAA GIT CTT OCA GAA GCA GGT CA^ A£A CCA ATC TIC AAA CAG T^ 10-5 
He Gin val Pro Glu Gly Gly Glu Thr Pro He Hie Lvs Gin Pho 
330 335 340 

TIT AAG GAC TCG AC:. CAT AC.:. GAT CAi3 AiX GAT GGC TTC GC33 AAA GTC 1103 
Phe Lys A.SP 1^ Arg A^ Arg Asp Gin Ser A3P Glv Phe Gly Lvs Val 
345 350 355 



TATGTCACAGAAAAAGTCGCTCACGIAAAAC^ATrCCATITGATGCC HSl 
TVr Val Thr Glu Lys Val Ala His Val Lys Gin He Pro Phe Asp Ala 

375 



360 365 .370 
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TCA AAA TIG CAC AGC TCC CCA CAA ATG GCA GCC CAG CAT CAC GTC GIG 1199 
Ser Lys Leu His Ser Ser Pro Gin Met Ala Ala Gin His His Val Val 
380 385 390 

GAT GAC GGT TCT GGC AAA GTC CAG ATT T3G CGT GTA GAA AAC PMl GGT 1247 
Asp A^ Gly Ser Gly Lys Val Gin He Trp Arg Val Glu Asn Asn Gly 
395 400 405 

AGG GTC GAA ATT GAC OGA AAC TOG TAT GGT GAA TIC. TAT GGT GGT GAT 1295 
Arg Val Glu He Asp Axg Asn Ser Tyr Gly Glu Phe IVr Gly Gly A-SD 
410 415 420 

TGC TPJZ ATT A.TA CTT TAC AjCT TAT CCC AjGA GGA CAG ATT ATC TAjC AjGC 1343 
Cys Tyr He He Leu TVr Ihr Tyr Pro Arg Gly Gin He He TVr Thr 
425 430 435 

TGG CAA GGA GCA AA^T GCC ACA OGG GA.T GAG CTG ACA ACC TCC GCA TTC 1391 
Trp Gin Gly Ala A-sn AJ_a Tnx Arg A^ Glu Leu Thr Thr Ser Ala Phe 
440 445 450 455 

CIG ACT GIT CAG TIG CAT AjGA TCC CTC OGG GCA CAG GCT GIG CAG ATT 1439 
Leu Thr Val Glxi Leu A-sp Arg Ser Leu Gly Gly Gin Ala Val Gin He 
460 465 470 

CGA GTC TCC CAA GGC AAA CAA CCT GOT CAC CIG CIG AGT TTG TTC AAA 1487 
Arg Val Ser G3xl Gly Lys Glu Pro Ala His Leu Leu Ser Leu Phe Lys 
475 480 485 

GAG AAA CCG CTC ATT A.TT TAC AAG AAC GGA ACA TCA AAG AAA CAA GGT 1535 
A^ Lys Rro Leu He He Tyr Lys Asn Gly Thr Ser Lys Ly-s Glu Gly 
490 495 500 

CAG GCA CCA GCC CCC CCT ATA GGC CTC TTT CAA GIC CCA ACA AAC CIG 1583 
Gin Ala Pro Ala Pro Pro He A-rg Leu Phe Gin Val Arg A-rg A-sn Leu 
505 510 515 

GCT TOG ATC ACC ACA ATT ATG GAG GTA CAT GTT CAT GCA AAC TCA TTG 1631 
Ala Ser He Thr A^g He Met Glu Val A^ Val Asp Ala A^ Ser Leu 
520 525 530 535 

AAl? TCC AAT GAT GTT TTT GTC CIG AAA. CTG CCA CAA AAT AAT GGC TAC 1679 
Asn Ser A-sn A^ Val Phe Val Leu Lys Leu Arg Gin Asn Asn Gly Tyr 
540 545 550 

ATC TGG ATA GGA AAA GGC TCC ACA CAG GAG GAG GAG AAA GGA GCA GAG 1727 
He Trp He Gly Lys Gly Ser Thr Gin Glu Glu Glu Lys Gly Ala Glu 
555 560 565 
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TACGTCGCAAGCGTCCnCAAATOCAAA^^CTTOG^ 1775 
Tyr Val Ala Ser Val Leu Lys Cys Lys Tnr Sex Tnr lie Gin Glu Gly 
570 575 580 

AAG GAA CCA GAG GAG TIT T3G AA.T TCC CTT GGA G33 AAA AAA GAC TAG 1823 
Lys Glu Pro Glu Glu Phe Trp A^ Ser Leu Gly Gly Lys Lys Asp Tyr 
585 590 595 

CPG ACC TCT OCT GIG CTA GAA. TCC C^jG GCT GAA GAC CAT CCA CCT CGG 1871 
Gin Tnr Ser Pro Leu Leu Glu Ser Glxi Ala Glu A^ His Pro Pro Arg 
600 605 510 615 

CTT TAC G3C T3C TCC AAC AAA. ACT GGA AjGA TIC ATT ATT GAA QAG GIT 1S19 
Leu Tyr Gly CVs Ser A^ Lys l^ir Gly Arg Phe lie He Glu Glu Val 
620 625 630 

CCA GGA aAG TIC ACC CAG GAT GAT TTA GCA GAA GAT GAT GTC ATG CIG 1967 
Pro Gly Glu Phe Thr Gin Asp Asp Le^ AJLa Glu A-sp Asp Val Met Leu 
635 540 645 

TTA GAT GCr T3G QAA CAG ATT TIT ATT TGG ATT GCA AAA GAT* GCC AAT 2015 
Leu A-sp AJ.a Trp Glu Gin He Phe He Trp He Gly Lys A-so Al.a A^ 
650 655 660 

GAA GTT CAG AAA TCA GAA TCP CIG AAG TCT GCC AAA ATA TAC CTT CAG 2063 
Glu Val Glu Lys Ser Glu Ser Leu Lys Ser Ala Lys He Tyr Leu Glu 
665 670 675 

ACC CAC CCT TCT GCA ACA CAC AAG AGG ACG CCA ATT GIC ATC ATA AAA 2111 
Tnr Asp Pro Ser Gly Arg Asp Lys Arg Thr Pro He Val He He Lys 
680 585 690 695 

CAG GGT CAT CAG CCA CCT ACT TIC ACA Q3C TG3 TIC CIG G3C IGG GAT 2159 
Gin Gly Kis Glu Pro Pro Thr Phe Thr Gly Tcp Phe Leu Gly Trp A-sp 
700 705 " 710 

TCC AGC AGG TGG TAAACICATT TTIGTAGCAA AAAAACAAAT A.TAATGGGGC 2211 
Ser Ser A-rg Trp 
715 

AjGCTGIOCCA GG333CAAG3 AGGAGdTGT TTAACTITAG AAAATTAACC TCAGCCATAT 2271 

GGCTATTTTT COGIGCTIAG AATTGGTTT3 AAATnCTTT TAAACIGGAA TITICTTATG 2331 

TTAATATITT TATAACITTT CTEATGCACC AAIATT^GCT CIGCIGGATG CICACATATG 2391 

TTEATATATG ACTTTTTAAA GGGGCOG 2418 
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SEQ ID NO: 5 

Sequence length: 2630 

Sequence Type: nucleic acid 

Strandeness : double 

5 Topology: linear 

Molecule type: cDNA 

Sequence characteristic: 

Symbol Showing Characteristic: mat peptide 
Location: 79. . 2223 

10 Sequence description: 
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AAQj-lH-UiC CTOZTCCTUr OGGTTIJGrc CAAGATCAGC 40 



GATATC?DGC GTCCOXDGA GCATCGCGTG GyGGAGCC ATG GOG CGG G?C CEA TT^JZ 96 

Met Ala Arg Glu Leu lyr 
1 5 



CAC GAA a^J3 TTC GCC C3G GOG G3C AAG O^G GCG GGG CTG CAG GTC TGG 144 
His Glu Glu ?he Ala Ary Ala Gly Lys Gin Ala Gly Lsu Gin Val Trp 
10 15 20 

AGG ATf G?n AAG CIG GAG CIG GIG CXX GIG CCC C?^ AGC GCT CAC GGC 192 
Arg lie Glu Lys Lea Glu Leu Val Pro Val Pro Gin Ser Ala I-iis Gly 
25 30 35 

GAG TIC 1^ GTC G3G QAT GCC TAG CIG GIG CIG CAC AGG GCC A.AG ACG 240 
Pisp Phe Tyr Val Gly Asp Ala Tyr Leu Val Leu His Ihr Ala Lys Thr 
40 45 50 



AGC OGA GGC TIC ACC TAG CAC CIG CAC TIC TGG CIC GGA AAG QAG 1GT 288 
Ser Arg Gly Phe Ihr TVr His Lsu His Phe Trp Leu Gly Lys Glu CVs 
55 60 65 70 

TCC CAG GAT (lAA AGC ACA GCT GCT GCC A1C TIC ACT GTT CAG AIG GKT 336 
Ser Gin A^ Glu Ser Ihr Ala AJ.a Ala lie Phe Ihr Val Gin Met A^ 
75 80 85 

QAC -mT TIG G3T GGC A-AG CCA GIG CAG AAT ACA GAA CIT CAA GGA TAT 384 
Asp Tyr Leu Gly Gly Lys Pro Val Gin Asn Arg Glu Lsu Gin Gly Tyr 
90 95 100 

GAG TCT AAT CAC TIT GTT AGC TAT TIC AAA GGC GGT CIG AAA TAG AAG 432 
Glu Ser Asn Asp Phe Val Ser Tyr Phe Lys Gly Gly Lsu Lys Tyr Lys 
105 110 115 

GCT GCA GGC GIG GCA 1CT GCA TTA AAT CAT GTT CIT AOG AAC CAC CIG 480 
AJLa Gly Gly Val Ala Ser Gly Leu Asn His Val Leu Ihr A3n Asp Leu 
120 125 130 



ACA GCC AAG z^GG CTC CTA CAT GT3 AAG GGT OGT AGA GT3 GIG ACA GCC 
Thr Ala Lys Arg Leu Leu His Val Lys Gly Arg Arg Val Val Arg Ala 
135 140 145 150 



523 



ACA GAA GTT GCC CTT AjGC TGG GAG AGT TIC AAC AAG GGT GAC TGC TIC 
Thr Glu Val Pro Leu Ser Trp Asp Ser Phe Asn Lys Gly Asp Cys Phe 
155 160 165 



576 
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ATC ATT GAC CIT GGC .^iZC GAA ATT TAT CAG T33 TCn- GGT -nx: TCG TCC 624 
He He Asp Leu Gly Thr Glu He Tyr Gin Trp Cys Gly Ser Ser Cys 
170 175 180 

AAC AAA TAT a=A CGT CTC AA3 GCA AAC GAG GIA GCT ACT GGC ATT CGG 672 
Asn Lys Tyr Glu Arg Leu Lys Ala Asn Gin Val Ala Ihr Gly He Arg 
185 190 195 

T^ AAT GAA PO^ AAA GGA K33 TCT GAA CTA ATT GIG GIG GAA GAA GGA 720 
Tyr Asn Glu Arg Lys Gly Pcrg Ser Glu Lsu He Val Val Glu Glu Gly 
200 205 210 

AGT GAA CrC TCA GAA CTT ATA AAG GTC OGG QAA AAG CC\ G^n CTT 768 
Ser Glu Pro Ser Glu Leu He Lys Val Leu Glv Glu Lys P^o Glu Leu 
215 220 225 230 

CCA GAT GGA GGT GAT GAT GAT GAC ATT ATA GCA GAC ATA AGT AAC AJ3G 816 
Pro A^ Gly Gly Asp Asp A-sp A^ He He Ala Asp He Ser Asn Arg 
235 240 245 

AAA AT3 GCT A^A CIA TAG ATG GIT TCA GAT GCA .AGT GGC TCC ATG AGA 864 
Lys Ifet Ala Lys Leu Tyr Met Val Ser Asp AJLa Ser Gly Ser >fet Ajcg 
250 255 260 

GIG ACT GIG GIG GCA GAA GAA AJ^C CCC TIC 1CA AIG GCA ATC CIG CIG 912 
Val Unr Val Val Ala Glu Glu A^sn Pro Phe Ser Ifet AJ.a Met Leu Leu 
265 270 275 

TCT GAA GAA 1GC TIT ATT TIG GAC CAC GGG GCT GCC AAA CAA ATT TIC 960 
ger Glu Glu C^s Phe He Leu Asp His Gly Ala Ala Lys Gin He Phe 
280 285 290 

cm 1Q3 AAA GGT AAA GAT GCT AAT CCC CAA GAG AG3 AAG GCT GCA ATG 1008 
Val Trp Lys Gly Lys A.sp Ala Asn Pro Gin Glu Arg Lvs Ala Ala l<fet 
295 300 305 " 310 

AAG ACA GCT GAA GAA TTT CTA CAG CAA ATG AAT TAT TOT AAC AAT ACC 1056 
Lys Tnr Ala Glu Glu Phe Leu Gin Gin Met Asn Tyr Ser Lys A^ Thr 
315 320 ' 325 

CAA ATT CAA GTT CTT CCA GAA GGA GGT GAA ACA CCA ATC TTC AAA CAG 1104 
Gin He Gin Val Leu Pro Glu Gly Gly Glu Thr Pro He Phe Lys Gin 
330 335 340 

TTT TTT AAG GAC TGG AGA GAT AAA GAT CAG AGT GAT GGC TTC GGG AAA 1152 
Phe Phe Lys Asp Trp Arg Asp Lys Asp Gin Ser Asp Gly Phe Gly Lys 
345 350 355 
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GTT TAT GTC ^CA G?G AAA GIG GCT CAA ATA CAA ATT OX TTT GAT 1200 

Val Tyr Val Ihr Glu Lys Val Ala Gin He Lys Gin He Pro Phe Asp 
350 365 370 

GCC TCA AAA TEA CAC AGT TCT COG CAG A1G GCA GCC GAG CAC AAT ATG 1248 
Ala Ser Lys Leu His Ser Ser Pro Glxi Met Ala Ala Gin His Asn Met 
375 380 . 385 390 

GIG GAT GAT GGT TGT GGC AAJ^ GIG GAG ATT TGG CGT GTA GAA AAC AAT 1296 
Val Asp Asg Gly Ser Gly Lys Val Glu He Trp Arg Val Glu Asn Asn 
395 400 405 

GGT AGG Aire CAA GTT GAC CAA AAC TCA T?IT GGT GAA TIC TAT GGT GGT 1344 
Gly Arg He Gin Val A-sp Gin Asn Ser Tyr Gly Glu Phe TVr Gly Gly 
410 415 420 

GAC TCC TAC ATC Am CIC TAG ACC TAT CCC AX^ GGA CAG AITT ATC TAC 1392 
Asp Cys Tyr lie lie Leu Tyr Ihr Tyr Pro Arg Gly Gin He He Tyr 
425 43 0 435 

AjCG TGG CAA GGA GCA AAT GCC ACA CGA GAT CAG CIG ACA AjCA TCT C<G 1440 
Thr Trp Gin Gly A-la Ajsn AJLa Thr Ajcg Asp Glu Leu Ihr Thr Ser AJLa 
440 445 450 

TIC CIG ACT GTT CAG TIG GAT CGG TCC CTT GCA GGA CAG GZT GIG CAG 1488 
Phe Leu Thr Val Gin Leu Ajsp Arg Ser Leu Gly Gly Gin Ala Val Gin 
455 460 465 470 

A1C CGA GIC TCC CAA GGC AAA CAG CCT GIT CAC CTA CIG AGT TIG TTC 1536 
He Arg Val Ser Gin Gly Lys Glu Pro Val His Leu Leu Ser Leu Phe 
475 480 485 

AAA GAC AAA COG CIC ATT ATT TAC AAG AAT GCA ACA TCA AAC AAA GGA. 1584 
Lys A-sp Lys Pro Leu He He Tyr Lys A^ Gly Ihr Ser Lys Lys Gly 
490 495 500 

GGT CAG GCA CCT GCT CCC CCT ACA GGC CIC TTT CAA GIC CGG AGA AAC 1632 
Gly Gin AJLa Pro AJ.a Pro Pro Thr Arg Leu Phe Gin Val Ajrg Arg Asn 
505 510 515 

CIG GCA TCT ATC ACC AGA ATT GIG GAG GTT GAT GTT GAT GCA AAT TCA 1680 
Leu Ala Ser He Thr Arg He Val Glu Val Asp Val Asp Ala Asn Ser 
520 525 530 

CIG AAT TCT AAC GAT GTT TGT GTC CIG AAA CIG CCA CAA AAT AGT GGC 1728 
Leu Asn Ser Asn Asp Val Cys Val Leu Lys Leu Pro Gin Asn Ser Gly 
535 540 545 550 



- 55 - 



TACATCTO3GTAGGA;^JV^GGTGCrAGCQ^G?CGAGGAGAAAGGAGC^ 1776 
Tyr lie Tcp Val Gly Lys Gly Ala Ser Gin Glu Glu Glu Lys Gly Ala 
555 560 565 

GAG TAT GTA GCA AGT GTC CIA AAG TCC AAA A£X: TEA AGG ATC CAA GAA 1824 
Glu Tyr Val Ala Ser Val Leu Lys Cys Lys TlTr Leu Arg He Gin Glu 
570 575 580- 

GGC GAG GAG CCA GAG GAJ3 TIC TCG AAT ICC CIT GGA GGG AAA AAA GAr 1872 
Gly Glu Glu Pro Glu Glu Phe Trp A^n Ser Lea Gly Gly Lys Lys hsp 
585 590 595 " ' 

TAG CAG ACC TCA OCA. CIA GIG GAA ACC CAG GOT QAA &C CAT CCA CCT 1920 
TVr Gin Ihr Ser Pro Leu Leu Glu Thr Gin Ala Glu A^ His Pro Pro 
600 605 610 

CGG CTT TAC GGC 1GC TCT AAC AAA ACT GGA AGA TTT GTT A.TT GAA GAG 1968 
Arg Lfiu Tyr Gly Cys Ser Asn Lys Thr Gly Arg Phe Val He Glu Glu 
615 620 625 630 

ATT CCA GGA CAG TIC ACC CAG CAT GAT Tm GCT GAA GAT GAT C?IC ATC 2016 
He Pro Gly Glu Phe Thr Gin A^ A^ Leu AJ.a Glu A^ ksp Val Met 
635 640 645 

TTA CI3^ GAT GCr T3G GAA CAG ATA TIT ATT T3G ATT GGC AAA CAT GCP 2064 
Leu Leu ?^sg Ala Trp Glu . Gin He Phe He Trp He Gly Lys A^ Ala 
650 655 660 

AAT GAA GTT GAG AAA AAA CAA lUT CIG AAG TCT GCC AAA ATG TAC CTT 2112 
A^ Glu Val Glu Lys Lys Glu Ser Leu Lys Ser Ala Lys tfet lyr Leu 
665 670 675 

CAG ACA CAC CCT ICT GGA AGA GAC AAG AGG ACA OCA ATT GIC ATC ATA 2160 
Glu Ihr Asp Pro Ser Gly Ajng Asp Lys Arg Ihr Pro He Val He He 
680 685 690 

AAA CAG GGC CAT CAG CCA CCC AjCA TIC ACA GGC TGG TIC CIG GGC TOG 2208 
Lys Gin Gly His Glu Pro Pro Thr Phe Thr Gly Tcp Phe Leu Gly Trp 
695 700 705 710 

CAT TCC AGC AAG TGG TAAATTGGTA TITGTAAAAA GCAAACAAAC ATIACAAGGC 2263 
Asp Ser Ser Lys Trp 
715 

AGTEAICICA TTGCTGTTTr G3GAGAGGAA 033GAAAAGC ' I ' l ' i ' i ' lU- ' l ' lA I ' i ' iUiL ' iTlT 2323 
GAAAATI7AG GCIQ3G0G0G GTOGCTCACA CCIGTAATCC CAGCACTTTG AGAGGATCAG 2383 
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GTPGGOjGAT CACIDGGGTC PGGATTTCGA GACCAGOCTG GCCAACATGG 03AAACCICG 2443 

CCTCTRCn^A AAAl^JZ^AAA .^;\A'mGCTG CGCGHJblU^ TCCACGCCTG 'D^GTCCCIGC 2503 

•EVrnOGAAG GCIG?J3AC^ GAAAATTGCT TCAGCOZ;^ AGGCTGftGGT TCCAGIG^ZSC 2563 

OI3GATIGCG (XACCACACT CCAGCCIG3G CAACSGSGAC TCICTCTCAA .WAAAAAAA 2623 
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SEQ ID NO: 
Sequence length: 
Sequence Type: 
Topology : 
5 Molecule type: 

Sequence description : 

SEQ ID NO: 
Sequence length: 
10 Sequence Type: 
Topology : 
Molecule type: 
Sequence description : 



6 

16 

amino acid 

linear 

peptide 

LNKVLTNDLTAKRLLH 
7 

16 

amino acid 

linear 

peptide 

KVYVTE KVAQ I KQ I P F 
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